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METHOD OF TRANSPOSON-MEDIATED MUTAGENESIS IN THE 
NEMATODE CAENORHABDITIS ELEGANS 



CROSS-REFERENCE TO RELATED APPLICATION 
[0001] This application is a national stage entry of and claims priority under 35 U.S.C. 
§ 365 to PCT/US00/40091, filed June 1, 2000, designating the United States of America, 
corresponding to PCT International Publication WO 00/073510 (published in English on 
December 7, 2000), which claims the benefit of U.S. Provisional Application No. 60/136,972, 
filed June 1, 1999. 

1. FIELD OF THE INVENTION 

[0002] The present invention relates to methods for generating and identifying 
mutations in the genome of the nematode Caenorhabditis elegans (hereinafter "C elegans"). 
More specifically, the present invention relates to a transgene construct for expression in 
C. elegans, and to methods for regulating mobilization of heterologous or endogenous 
transposons in the C. elegans genome, inserting a heterologous DNA sequence into C. elegans 
germline DNA, and engineering mutations into the C elegans genome. 

2. TECHNICAL BACKGROUND 

[0003] The use of model genetic systems had its beginnings in the earliest days of the 
science of genetics and, as a result of the tremendous value of such systems in understanding 
genetic phenomena, continues in the present. Researchers often use in their work organisms 
which have short life spans, limited space requirements, and relatively small genomes. 
Specifically, certain species of worms, fruit flies, and yeast cells are common subjects of 
research. Using such organisms, researchers may learn the function of the various genes found 
within the DNA of the organisms. One commonly used method is to generate mutations in the 
genome of an organism, followed by selection or screening for those mutations which confer a 
specific property or characteristic to the organism. These mutational studies suggest probable 
functions for the genes in which mutations occur. Mutations often occur when a gene is changed 
in such a way that the product of the gene is altered or nonfunctional. 



2 



[0004] A common method for generating mutations uses transposable elements. 
Transposable elements are segments of DNA which have the ability to "hop" — that is, to be 
excised from their initial position in the DNA and move to a new location. In doing this, a 
transposable element, also known as a transposon, may insert into some portion of a gene, thus 
disrupting or even changing the function of the gene. Further, additional mutations may be 
created by remobilizing the transposon. Since this remobilization often occurs imperfectly, 
changes are created in the DNA sequence, leaving the final sequence different from the original 
sequence. See J.D. Watson, J. Witkowski, M. Gilman, and M. Zoller, Recombinant DNA 
175-190, 439-440 2d. ed. (1996). 

[0005] The P element, a transposable element found in the genes of fruit flies, see, e.g., 
A. C. Spradling, G. M. Rubin, Science 218, 341 (1982); J.D. Watson, J. Witkowski, M. Gilman, 
and M. Zoller, Recombinant DNA 175, 177 2d. ed. (1996). (1996), has been an enormously 
useful tool in Drosophila genetic analysis for two reasons. First, these transposons have been 
used for insertional mutagenesis. Mutagenic insertions constitute molecular tags that are used to 
rapidly clone the mutated gene. L. Cooley, R. Kelley, A. Spradling, Science 239, 1121 (1988). 
Particularly helpful in such studies is the presence of strains that lack any copies of the 
transposon. Second, P elements are used to introduce single copies of foreign sequences into the 
host genome. This feature is particularly useful for the rapid identification of gene expression 
patterns by using enhancer traps. H. J. Bellen, et al., Genes Dev. 3, 1288 (1989). The 
availability of such techniques would be particularly advantageous in studies of the genome of 
the nematode C. elegans. 

[0006] C. elegans is a model system in which genetics can be used to identify genes 
and biological pathways which are conserved between nematodes and vertebrates, and which 
thus constitute potential targets for the treatment of various diseases. C. elegans is particularly 
advantageous for genetic studies because it is easily propagated and because the genetic and 
physical maps of its genome are well-characterized. W. B. Wood, Introduction to C. elegans 
Biology (1988). The characterization of gene structure in C. elegans has become routine, largely 
through the efforts of the C. elegans genome project. The workers involved in this effort have 
cloned the entire genome into cosmid or YAC vectors and have completed the genomic 
sequence. C. elegans Sequencing Consortium, Science 282:2012-2018 (1998); A. Coulson et al., 
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Proc Natl Acad Sci USA Proc. Natl. Acad. Sci. USA 83:7821-7825 (1986); A. Coulson et al., 
Bioessays 13:413-417 (1991); R. Wilson et al., Nature 368:32-38 (1994). 

[0007] Standard mutagenesis in C elegans employs chemical mutagens. After 
generation of a mutant, identification of the gene requires time-consuming genetic mapping 
followed by single gene rescue. Alternatively, transposon-based mutagenesis has been attempted 
using mutant backgrounds like mut-2, but efficiency of transposition is low and not specific for a 
defined transposon class. Further, since the genomes of all G elegans strains contain 
transposons, it is very difficult to identify relevant insertions. Thus, utility of native transposons 
for regulated transposition in C elegans is limited. First, all strains contain multiple copies of 
these transposons and thus new insertions do not provide unique tags. Second, mutator strains 
tend to activate the transposition of several classes of transposons, so that the type of transposon 
associated with a particular mutation is not known. Third, transposition is not regulated and the 
transposon tag can be lost by excision in subsequent generations. Fourth, attempts to regulate 
transposase expression have failed because expression of transgenes in the germline of C 
elegans is very difficult. Although one could theoretically regulate the transposition of a specific 
element by expressing the transposase under the control of a germline-specific promoter, 
transgenic arrays are typically silenced in the germline. W. G. Kelly, S. Xu, M. K. Montgomery, 
A. Fire, Genetics 146, 227 (1997) (1997). 

[0008] Another problem in this field is the difficulty of expressing DNA in the C. 
elegans germline. Current methods, see, e.g., W. G. Kelly et al, Genetics 146:227-238 (1997), 
are not adequate. First, current methods for expressing foreign DNA in the C elegans germline 
do not work for all genes. Second, expression of genes introduced using these methods declines 
over time. 

[0009] Finally, introduction of single copy DNA is not possible using existing 
technology. 

[0010] From the foregoing, it will be appreciated that it would be a significant 
advancement in the art to provide methods that allow regulated expression of foreign DNA in the 
C elegans germline. It would be a further advancement to provide methods that allow germline 
expression of a transgene in C elegans. It would be a further advancement in the art to provide 
regulated expression of such a transgene in the germline, as by regulation using a heat-shock 
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promoter. It would be a further advancement to provide methods of regulating the transposition 
of either endogenous or heterologous transposons in C. elegans. Further, it would be an 
advancement to provide transgene constructs to facilitate germline expression of transgenes and 
regulated transposition of homologous and heterologous transposons. Such compositions of 
matter and methods are disclosed herein. 

3. BRIEF SUMMARY OF THE INVENTION 

[0011] The present invention relates to improved methods for generating and 
identifying mutations in C elegans, and includes methods for introducing heterologous DNA 
into the C. elegans germline and causing its expression. In certain embodiments, a method of the 
present invention comprises the steps of inserting a transgene construct into the C. elegans, 
wherein the construct comprises a heterologous gene operably linked to a promoter and a 3' 
untranslated region of a gene that is expressed in the C. elegans germline; and expressing the 
heterologous gene. In certain embodiments, this method further comprises the removal of all 
bacterial plasmid sequences and repeated sequences from the DNA to be introduced. In certain 
preferred embodiments, a promoter that is active in the C. elegans germline drives expression of 
the transgene. In certain especially preferred embodiments, the promoter is an inducible 
promoter. 

[0012] The present invention further relates to a transgene construct for expression in 
C. elegans which comprises a heterologous gene operably linked to a promoter and a 3' 
untranslated region of a gene expressed in the C. elegans genome. In certain embodiments, the 
transgene construct further comprises a promoter that is active in the germline of C. elegans or a 
promoter that is inducible. 

[0013] The present invention further relates to methods for generating and identifying 
mutations in C elegans. In one embodiment, a method of the present invention comprises the 
introduction and expression of a transposase gene to mobilize either endogenous or heterologous 
transposons. In certain preferred embodiments, the transposons are endogenous Tc3 
transposons. 

[0014] In certain other embodiments, the transposons are heterologous transposons, 
such as the Drosophila mariner element. Controlled mobilization of heterologous transposons 
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allows the generation of mutations, which are tagged by the insertion of the transposon. 
PCR-based techniques permit rapid identification of the transposon insertion that caused the 
mutation. 

[0015] The present invention further relates to methods for introducing single copy 
DNA sequences into C. elegans. In certain preferred embodiments, a method of the present 
invention comprises introducing a transposon comprising a heterologous DNA sequence into a 
G elegans, introducing a transgene construct comprising a transposase gene operably linked to a 
promoter and a 3 ' untranslated region of a gene that is expressed in the C elegans germline, and 
expressing the transposase such that the transposase integrates into a C elegans chromosome as 
a single copy. The transposon may be engineered to introduce a DNA sequence, such as one that 
codes for a reporter gene such as, for example, a green fluorescent protein. The introduced DNA 
sequence may also contain FRT/FLP or CRE/LOX recombination sites. Alternatively, the 
introduced DNA sequence may contain polyadenylation sites or transcriptional terminators. 

[0016] These and other features and advantages of the present invention will become 
more fully apparent from the following detailed description. 

4. SUMMARY OF THE DRAWINGS 

[0017] Figure 1 schematically depicts a method for mutagenesis by controlled 
heterologous transposition. 

[0018] Figure 2 depicts the structure of the pJL44 Mosl transposase expression vector. 

[0019] Figure 3 schematically depicts a method for identifying sequences flanking the 
Mes~l- Mosl insertion site using inverse PCR. 

[0020] Figure 4 depicts the sequence (SEQ ID NO.: NO: 23) of an inverse PCR 
product. Nucleotides in capital letters are from the Mosl transposon. The C elegans flanking 
G e/egafls-flanking g enomic region is in lower case. It matches the Y47C4.A sequence from 
chromosome X available at the Sanger Centre. See the Sanger Centre web site at 
http ://www. sanger.ac.uk/Proj ects/C_elegans/ . 

[0021] Figure 5 depicts the mobilization of Mosl in C. elegance somatic cells. (A) 
Engineering of the Mos transposase encoding sequence. Depicted therein are nucleotides 1-21 
(SEQ ID NO:24), 262-279 (SEQ ID NO:25), and 1039-1058 (SEQ ID NO: 26) of the Mosl 
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gene. Also depicted are the corresponding amino acids 1-4 (SEQ ID NO:27), 85-90 (SEQ ID 
NO:28), and 344-345 of the Mosl protein sequence. Restriction sites were generated at the 5' 
and 3' ends of the coding sequence (new sequence is indicated under the original sequence). The 
endogenous polyadenylation signal (boxed) was disrupted and an artificial intron (SEQ ID 
NO: 18) was introduced in the coding sequence in order to improve transposase expression. See 
A. Fire, S. W. Harrison, D. Dixon, Gene 93, 189 (1990). (B) Localization of Mosl insertions 
into unc-49 and gpa-2 genes after induction of Mos transposase expression in somatic cells. 
Open triangles: insertion sites; black rectangles: coding exons; white rectangle: non coding 
exonic sequence. Arrows: genomic primers used to amplify the insertions. (C) Sequence 
comparison of 22 insertion sites. Insertion sites are oriented relative to the 5' end of the Mosl 
transposon. Sequences that flank Mosl at the right end were identified by PCR. DNA 
purification and PCR were performed as described in H. G. van Luenen, S. D. Colloms, R. H. 
Plasterk, Embo /. 12, 2513 (1993). The primers in Mosl were 0JL88— (S^- 
CGCATGCGGCTTACTCAC (5 -CGCATGCGGCTTACTC AC (SEP ID NO: 4)) first PCR; 
and oJL89 (5 f GGCCCCATCCGATTACCACCTA (5 -GGCCCCATCCGATTACCACCTA 
(SEQ ID NO: 5)) second PCR. Primers in unc-49 were oJL19 

(5 -GCGAAACGCATACCAACTGTA (SEQ ID NO: 6)) first PCR; and oJL20— 
TTCATGCCGAAAAGCAGGCGT (5 -TTCATGCCGAAAAGC AGGCGT (SEP ID NO: 7)) 
second PCR. Primers in gpa-2 were the same as described in H. G. van Luenen, S. D. Colloms, 
R. H. Plasterk, Embo J. 12, 2513 (1993). PCR products were gel-purified and sequenced using 
oJL89 (SEQ ID NG: 5) as a primer, (positive positions on the graph), sequences that flank the 
left end of Mosl were deduced from unc-49 and gpa-2 sequences (negative positions on the 
graph). 

[0022] Figure 6 depicts germline mobilization of Mosl. (A) Mos transposase was 
expressed from an extrachromosomal array using either a glh-2 or a heat-shock promoter. The 
Mosl transposon was contained in an array integrated on chromosome V 
(oxIs25rMosl;rol 6(sd1). (ox!s25 [Mosl\rol-6(sd)l}. The array containing array-containing 
chromosome was balanced by the dpy ll( e 22 4 ) dpy-ll(e224) mutation. In the next generation, 
catastrophic excision of the transgene was observed (indicated as Aoxls) among the progeny. 
(B) Comparison of excision and insertion frequencies using glh-2 and heat-shock (hsp) 



7 



promoters to drive Mos transposase expression in the germline. New Mosl insertions were 
identified by PCR. Specifically, the presence of Mosl was detected through PCR by using two 
primers located in the transposon, oJL102 (SEQ ID NO: 1) and oJL103 (SEQ ID NO: 3). The 
absence o f/), mauritiana flanking D. mauritiana-fldwkinz sequence was checked using oJL102 
(SEQ ID NO: 1) and oJL104 (SEQ ID NO: 2) as described below. In addition, a PCR positive 
control was performed on each DNA sample using oligonucleotides located in the cha-1 gene. 
Recombination events were recognized as Dpy worms also containing Mosl flanked by original 
Drosophila genomic sequences. 

[0023] Figure 7 shows Mosl genomic insertions. (A) Southern blot probed with 
labeled Mosl DNA. Lan e Lanes 1 to 8, strains in which insertions were detected by PCR; 
insertions derived from an extrachromosomal array and-Mes - Mos transposase expressed under 
the h e at shock heat-shock p romoter. Mosl presence was assessed by PCR using two primers 
located in the transposon (oJL102 (SEQ ID NO: 1) and oJL103 (SEQ ID NO: 3)). The absence 
o f D. mauritiana flanking D. mauritiana-flankm^ sequence was checked using oJL102 (SEQ ID 
NO: 1) and oJL104 (SEQ ID NO: 2), while a PCR positive control was performed on each DNA 
sample using oligonucleotides located in the cha-1 gene. The control lane is lin-15(n765) which 
had been used to build transgenic lines. Each lane contains 2 mg o f Bgl II dig e st e d Bgl 
Il-digested g enomic DNA. The Mosl probe (encompassing bases 1 to 173 of the trasnposon) 
transposon) w as synthesized by PCR using th e pBluoGcriptM13+/Mool pBluescriptM13+/Mas7 
plasmid as a template. (B) Distribution of Mosl inserts on the physical map of the C. e l e gans C 
elegans g enome. Black triangles: insertions from an extrachromosomal array. Open triangles: 
insertions from the integrated array oxIs25. Open circle: position of oxIs25, the integrated array 
of Mosl transposons. (C) DNA sequence of Mosl de novo insertions oxTil through oxTi6, 
oxTi8, oxTi9, and oxTill (SEQ ID NOS:29-46). Genomic fragments that flank the transposon 
left end were isolated by inverse PCR and sequenced. A primer was designed in the genomic 
region to the right of the insert and used with a Mosl sp e cific Mosl -specific p rimer to amplify 
and sequence the right end flanking the fragment. At insertion sites TA dinucleotides (bold) 
were duplicated during the process of transposon integration. Lower case: Mosl sequence. 
Uppercase: genomic sequence. Ellipses: omitted sequence. 
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[0024] Figure 8 shows a knock-in strategy wherei n Mosl Mosl excision causes a 
DNA double strand break, after which a transgene containing sequences homologous to the 
excision region pairs with the chromosome. Finally, the mutation contained in the transgene is 
copied into the chromosome. 

[0025] These drawings only provide information concerning typical embodiments of 
the invention and are not th e r e for e not, therefore, to be considered limiting of its scope. 

5. DETAILED DESCRIPTION OF THE INVENTION 

[0026] The present invention relates to novel methods for generating and identifying 
mutations in C. elegans, and includes methods for introducing a transgene into the C. elegans 
germline. The present invention also includes methods for expressing transgenes in the 
C. elegans germline. The present invention also includes a transgene construct for expression in 
C. elegans and methods for generating mutations by regulating the transposition of endogenous 
or heterologous transposons in C. elegans. The present invention also includes methods for 
inserting single copy DNA into a C. elegans genome by introducing a transgene comprising FLP 
Recombination Target (FRT) sites into a C. elegans genome and causing recombination. FLP is 
a site-specific recombinase which efficiently catalyzes recombination between FRT sites that 
have been placed in the genome. K. G. Golic, S. L. Lindquist, Cell 44, 521 (1986). When FRT 
sites are in the same relative orientation within a chromosome, the FLP recombinase excises the 
intervening DNA from the chromosome. See Golic, Kent G., Generating Mosaics By 
Site-Specific Recombination, In Cellular Interactions In Development: A Practical Approach, 
1-31 (D. A. Hartley, ed., Oxford Univ. Press 1993); and Plasterk R.H., Groenen J.T., Targeted 
Alterations of the Ca e norhabditis e l e gans Caenorhabditis elegans Genome by Transgene 
Instructed DNA Double Strand Break Repair Following Tel Excision, EMBO J., 11:287-90 
(1992). Other recombination systems, such as CRE/LOX, could also be used. 

[0027] All publications, patents, and patent applications cited herein are hereby 
incorporated by reference. U.S. Patent Application Serial No. 60/136,972 is hereby incorporated 
by reference in its entirety. 
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DEFINITIONS 

[0028] The term "heterologous" is used herein to include nucleic acid sequences such 
as coding sequences and control sequences that are not normally joined together, and/or are not 
normally associated with a particular cell. Thus, a heterologous region of a construct or vector is 
a segment of nucleic acid within or attached to another nucleic acid molecule that is not found in 
association with this other molecule in nature. For example, a heterologous region of a nucleic 
acid construct could include a coding sequence flanked by sequences not found in association 
with the coding sequence in nature (e.g., synthetic sequences having codons different from the 
native gene). Similarly, a cell transformed with a construct which is not normally present in the 
cell would be considered heterologous for purposes of this invention. The term includes, but is 
not limited to, a DNA sequence from another organism. 

[0029] The term "transgene" is a heterologous sequence that is introduced into an 
organism. The term includes both sequences that integrate into one or more chromosomal 
locations of the organism and sequences that are maintained extrachromosomally, e.g., as 
episomes. 

[0030] The term "regulable expression control element" includes promoters, 
polyadenylation signals, transcription termination sequences, upstream regulatory domains, 
origins of replication, internal ribosome entry sites, enhancers, and the like, which provide for 
the replication, transcription, and translation of a coding sequence in a recipient cell or in a cell 
of an organism. The term promoter refers to a DNA s e qu e nc e , sequence that is capable of 
binding RNA polymerase and initiating transcription of a downstream (3 ' direction) sequence. 
Inducible promoters are promoters which are regulable. Such promoters may be regulated by, 
for example, temperature, small molecules, or developmental stages of an organism. 

[0031] Inducible promoters include heat-shock promoters, which are induced by 
exposure to heat. Inducible also promot e rs promoters also include small molecule-regulated 
promoters. Other inducible promoters include promoters that are induced (or repressed) by 
tetracycline and its derivatives (Gossen & Bujard, Proc. Natl Acad, Sci. USA 89:5547-5551 
(1992)). 

[0032] "Operably linked" refers to an arrangement of elements in which the 
components so described are configured so as to perform their usual function. Thus, control 
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sequences such as regulable expression control elements operably linked to a coding sequence 
are capable of affecting the expression of the coding sequence. The control sequences need not 
be contiguous with the coding sequence so long as they function to direct the expression thereof. 
Thus, for example, intervening untranslated yet transcribed sequences can be present between a 
promoter sequence and the coding sequence and the promoter sequence can still be considered 
"operably linked" to the coding sequence. 

[0033] For the purpose of describing the relative position of nucleotide sequences in a 
particular nucleic acid molecule throughout the instant application, such as when a particular 
nucleotide sequence is described as being situated "upstream," "downstream," "3'," or "5'," 
relative to another sequence, it is to be understood that it is the position of the sequences in the 
"sense" or "coding" strand of a DNA molecule that is being referred to-as-4s- as conventional in 
the art. 

GENERAL METHODS 

[0034] In one embodiment, a method of regulated expression of a heterologous gene in 
cells of the germline of C. elegans comprises the steps of inserting a transgene construct into the 
C. elegans, wherein the construct comprises the heterologous gene operably linked to a promot e r 
promoter, and a 3' untranslated region of a gene that is expressed in the C. elegans g e rmlin e ; 
germline and expressing the heterologous gene. Other embodiments further comprise the use of 
a promoter which is inducible, such as a heat-shock promoter or a tetracycline-regulated 
promoter. Yet other preferred embodiments comprise the removal of substantially all bacterial 
plasmid sequences and repeated sequences from the transgene. In certain other especially 
preferred embodiments, the method of the present invention comprises the addition of the 3' 
untranslated region (UTR) of the glh-2 gene, which is expressed in the C. elegans germline, to 
the 3' end of the transgene. A promoter, such as a glh-2 promoter, which is a g e rmlin e sp e cific 
germline-specific p romoter, may be used to drive expression of the transgene. 

[0035] The present invention further comprises transgene constructs for expression in 
C. elegans which comprises a heterologous gene operably linked to a promoter and a 3' 
untranslated region of a gene that is expressed in the G elegans germline. In certain 
embodiments, the promoter is active in the cells of the germline of C elegans. In other 
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embodiments, the promoter comprises an inducible promoter such as a heat-shock promoter or a 
tetracycline-regulated promoter expressed in the germline. Yet other embodiments comprise the 
removal of substantially all bacterial plasmid sequences and repeated sequences from the 
transgene. In certain preferred embodiments, the method of the present invention comprises the 
addition of a 3' untranslated region (UTR) of a glh-2 gene, which is expressed in the C. elegans 
germline, to the 3' end of the transgene. A promoter, such as a glh-2 promoter, may be used to 
drive expression of the transgene. In still other embodiments, the heterologous gene codes for a 
transposase. In certain preferred embodiments, the heterologous gene is a TC3A transposase 
gene. 

[0036] The present invention also includes methods for generating mutations in the 
genome-e^-a - of C elegans by using controlled mobilization of transposons. In certain 
embodiments, the method for generating mutations in the genome-e£fr of C. elegans comprises 
the steps of introducing a transgene construct comprising a transposase gene which is operably 
linked to a regulable expression control element and a 3 ' untranslated region of a gene that is 
expressed in the C elegans germline into a C. elegans: C. elegans and expressing the 
transposase gene. Such methods allow the generation of mutants in which the mutated genes are 
tagged by the insertion of the transposon. PCR-based techniques permit fast identification of the 
transposon insertion that causes the mutation. Since the C. elegans genome has been entirely 
sequenced, sequencing of the genomic regions that flank the transposon allows immediate 
identification of the mutated gene. 

[0037] In certain embodiments, transposons used in the method of the present invention 
are endogenous transposons. Several different types of endogenous transposons are present in C 
elegans, and these can be mobilized in mutator strains. See, e.g., R. H. A. Plasterk, H. G. A. M. 
van Luenen, in C. elegans II, D. L. Riddle, T. Blumenthal, B. J. Meyer, J. R. Priess, Eds. (Cold 
Spring Harbor Laboratory Press, New York, 1997) pp. 97 116. 97-116. Mutator alleles have 
been useful in cloning C elegans genes, particularly in early studies before the genome project 
reagents were widely available. In certain preferred embodiments, the endogenous transposons 
are Tc3 transposons. In yet other embodiments, the transposase gene is a TC3A transposase 
gene. In still other embodiments, the regulable expression control element is an inducible 
promoter, comprising in some embodiments a heat-shock promoter or a tetracycline-regulated 
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promoter. Yet other embodiments comprise the removal of substantially all bacterial plasmid 
sequences and repeated sequences from the transgene construct. In certain preferred 
embodiments, the method of the present invention comprises the addition of the 3' untranslated 
region (UTR) of the glh-2 gene 5 which is expressed in the C. elegans germline, to the 3' end of 
the transgene. In such embodiments, the regulable expression control element comprises a 
promoter, such as a glh-2 promoter or a heat-shock promoter, which may be used to drive 
expression of the transgene. 

[0038] In other embodiments, mutants may be generated by using controlled 
mobilization of heterologous transposons. Using a heterologous transposon allows researchers to 
tag mutated genes with a sequence that is unique in C. elegans genome. These tagged mutations 
will allow the rapid cloning of the mutated genes. The primary advantage over the endogenous 
transposon scheme is that this method avoids the isolation of irrelevant insertions of the 
endogenous C. elegans transposons. A further advantage is that the expression of the 
heterologous transposase would only mobilize the heterologous element, and thus mutations 
should only be due to insertions of these elements. Additionally, insertions could be stabilized 
by loss of the transposase-expressing construct. 

[0039] In certain preferred embodiments, Mosl, a mariner-like transposon isolated 
from Drosophila mauritiana, is used. M. Medhora et al., Genetics 128:311-318 (1991). See 
generally D. L. Hartl et al., Annu. Rev, Genet 31:337-358 (1997). Mosl is a member of the 
mariner/Tel family and was initially identified in the fruitfly Drosophila mauritiana. J. W. 
Jacobson, M. M. Medhora, D. L. Hartl, Proc. Natl Acad. ScL USA 83, 8684 (1986). Like the 
other members of the mariner/Tel family, Mosl contains a single open reading frame which 
encodes the transposase. The transposase binds to and cleaves at the inverted terminal repeats 
(ITRs) present at each end of the transposon. See, e.g. D. L. Hartl, A.R. Lohe, E. R. Lozovskaya, 
Annu. Rev. Genet 31, 337 (1997); R. H. Plasterk, Z. Izsvak, Z. Ivies, Trends Genet 15, 326o32 
(1999). The Mosl transposase is the only protein necessary for transposition in vitro. L. R. Tosi, 
S. M. Beverly, Nucleic Acids Res. 28, 784 (2000). Because no additional factors are required for 
transposition, the Mosl transposon should be capable of transposition in heterologous species, 
and indeed the transposon has been mobilized in species evolutionarily distant from Drosophila. 
F. J. Gueiros-Filho, S. M. Beverly, Science 276, 1716 (1997); J. M. Fadool, D. L. Hartl, J. E. 
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Dowling, Proc. Natl Acad. ScL USA 95, 5182 (1998); A Sherman, et al., Nat Biotechnol. 16, 
1050 (1998); C. J. Coates, N. Jasinskeine, L. Miyashiro, A. A. James, Proa Natl Acad. ScL USA 
95, 3748 (1998). 

[0040] In other embodiments, the transposase gene comprises restriction sites 5' of the 
start codon, restriction sites 5' of the stop codon, and an artificial intron in the transposase gene 
open reading frame. Other preferred embodiments involve a regulable expression control 
element which comprises an inducible promoter such as a heat-shock promoter or a 
tetracycline-regulated promoter. Yet other preferred embodiments comprise the removal of 
substantially all bacterial plasmid DNA sequences and repeated sequences from the transgene 
construct. In certain preferred embodiments, the method of the present invention comprises the 
addition of the 3' untranslated region (UTR) of the glh-2 gene, which is expressed in the 
C. elegans germline, to the 3' end of the transgene. In such embodiments, the regulable 
expression control element may comprise a promoter, such as a glh-2 promoter, a myo-3 
promoter or a heat-shock promoter, which may be used to drive expression of the transgene. 

[0041] In yet other embodiments, the method of the present invention includes 
engineering the transposon to carry a heterologous DNA sequence into a C. elegans 
chromosome. Certain embodiments of the method of the current invention may comprise the 
steps of introducing a transposon into the C. elegans, wherein the transposon comprises the 
heterologous DNA sequence; introducing a transgene construct into the C. elegans, wherein the 
construct comprises a transposase gene which is operably linked to a promoter and a 3' 
untranslated region of a gene that is expressed in the C. elegans germline; and expressing the 
transposase, such that the transposon integrates as a single copy into a C. elegans chromosome. 

[0042] In some embodiments, the transposon may be modified to contain bacterial 
plasmid DNA sequences. Such sequences may simplify cloning of mutated genes into bacteria 
from C. elegans genomic DNA preparations. In yet other embodiments, the transposon may 
carry a gene useful for selection or screening purposes. 

[0043] In certain preferred embodiments, FRT/FLP or CRE/LOX recombination sites 
could be inserted into the transposon. One of skill in the art would appreciate that an engineered 
transposon carrying such recombination sites would facilitate insertion of single copy DNA into 
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the C. elegans genome. In other embodiments, the transposon could include polyadenylation 
sites or transcriptional terminators. 

[0044] In yet other preferred embodiments, the promoter is inducible. In such 
embodiments, inducible promoters such as a heat-shock promoter, may be used. Yet other 
preferred embodiments comprise the removal of substantially all bacterial plasmid DNA 
sequences and repeated sequences from the transgene construct. In certain preferred 
embodiments, the method of the present invention comprises the addition of the 3' untranslated 
region (UTR) of the glh-2 gene, which is expressed in the C. elegans germline, to the 3' end of 
the transgene. In such embodiments, the regulable expression control element may comprise a 
promoter, such as a glh-2 promoter, a myo-3 promoter or a heat-shock promoter, which may be 
used to drive expression of the transgene. 

6. EXAMPLES 

[0045] The following examples are given to illustrate several embodiments which have 
been made within the scope of the present invention. It is to be understood that these examples 
are neither comprehensive nor exhaustive of the many types of embodiments which can be 
prepared in accordance with the present invention. 

Example I - Mobilization of Endogenous Tc3 Transposons 
[0046] About 15 copies of the Tc3 transposon are present in the genome of the 
wild-type C elegans N2 strain. These transposable elements are inactive in wild-type animals. 
Our goal is to cause specific mobilization of the endogenous Tc3 copies by expressing the TC3 A 
transposase in the germline. New Tc3 insertions will be used as tags to clone the genes which 
they have disrupted. 

1 - TC3A expressed under the ced-9 promoter causes somatic and germline hops : 

[0047] The TC3A transposase gene has been cloned behind a ced-9 promoter. This 
construct has been coinjected with linearized C. elegans genomic DNA and the lin-15(+) 
plasmid into a lin-J5(-) strain and unstable transgenic strains have been obtained. Transposase 
activity was assayed by testing whether the construct could excise a Tc3 element from the 
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unc-22 gene and restore the function of the locus. The ced-9::Tc3A arrays have been crossed 
into unc-22(r750::Tc3)\ Un-15(n765ts) background. Wild-type revertants have been recovered 
from Unc-22 Fl animals, suggesting functional expression of the TC3A transposase. One of 
these extrachromosomal arrays was integrated into a chromosomal location to generate the 
insertion oxIsl7[ced-9::Tc3A; lin-J5(+)]. 

[0048] oxhl7 was mapped on the X chromosome and functionally characterized. 
Un-15(n765ts);oxIsl7[ced-9::Tc3A; lin~15(+)] males were crossed to unc-22(r750::Tc3)\ 
Un-15(n765ts) hermaphrodites. Heterozygous nonUnc nonLin hermaphrodites were cloned and 
allowed to self-fertilize. It was expected that among the progeny of these animals, there would 
be found 1/4 Unc animals homozygous for unc-22(r750::Tc3). Of those Unc individuals, 3/4 
should be either homozygous or heterozygous for oxIsl7 i.e. nonLin. It was observed, however, 
that the Unc nonLin animals were greatl y und e r r e pr e s e nt e d — inst e ad, under-represented; 
instead, there were many more nonUnc individuals. Since the ced-9 gene is ubiquitously 
expressed, it was reasoned that TC3A could be present not only in the germline but also in 
somatic cells and could cause somatic reversion of the Unc phenotype. To test this hypothesis, 
nonUnc nonLin individuals were cloned, assuming that a fraction of them could be homozygous 
for unc-22(r750::Tc3) despite thei r wild typ e wild-type p henotype. Self-progeny of these 
animals were scored. Individuals heterozygous for oxlsl 7 segregated 1/4 Lin animals (which no 
longer expressed the TC3A transposase). In this category, plates were identified in which 100 % 
100% of the Lin worms were Unc while almost 100% of the nonLin were nonUnc. Hence, the 
parent hermaphrodite must have been homozygous unc-22 (r7 50: :Tc3) mutant although its 
phenotype was wild-type. These data demonstrate that c e d 9::Tc3A ced-9::Tc3A causes 
somatic reversion of the unc 22(r750::Tc3) unc-22(r750::Tc3) locus at high frequency. 

[0049] Rare nonUnc Lin were looked for among the Lin animals generated b y s e lf 
f e rtilization self-fertilization of unc-22(r750::Tc3); lin-15(-)\ oxIsl7l+ hermaphrodites to 
determine germline reversion rates. Since the Lin worms had lost oxlsl 7 and had no TC3A 
transposase expressed somatically during development, the only way to revert the Unc 
phenotype was to receive one reverted copy of the unc-22 locus. This reversion event had to 
occur during germline development. Rare nonUnc Lin progeny were identified among Unc Lin 
progeny (experiment #1: 1 nonUnc Lin in 61 Lin total; exp. #2: 2/106; exp. #3: 4/203). It was 
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concluded tha t cod 9::Tc3A ced-9::Tc3A causes an approximately 2% reversion rate of the 
unc 22(r750::Tc3) unc-22(r750::Tc3) locus in the germline. 

2 - Expression of TC3A using a g/A-2 promoter : 

[0050] Since the somatic reversion caused by ced-9::Tc3A causes a discrepancy 
between the phenotype and the genotype of an individual carrying a locus disrupted by a Tc3 
insertion, a TC3A expression vector was designed based on the — g e rmlin e — sp e cific 
germline-specific 2 lh-2 gene (gift of Karen Bennett). A plasmid containing a glh-2 genomic 
fragment is able to rescue the Glh-2 mutant phenotype and is therefore likely to be expressed in 
the germline. The glh-2 open reading frame was deleted and replaced by a multiple cloning site 
to generate an expression cassette that retains glh-2 promoter and 3' untranslated regions. Tc3A 
was inserted to generate glh-2::Tc3A. This construct has been coinjected with linearized 
C. elegans genomic DNA and the lin-15(+) plasmid into a lin-15(~) strain and several unstable 
transgenic strains have been obtained. A plasmid driving strong expression of the Green 
Fluorescent Protein (hereinafter "GFP") in the coelomocytes (gift of Piali Sengupta) has also 
been incorporated also in this array. This allows monitoring of the presence of the array in a 
lin-15(+) background based on GFP expression. 

[0051] As described above, these arrays have been crossed into unc-22(r750:;Tc3)\ 
Un-15(n765ts) background. In contrast to the ced-9::Tc3A experiments, no somatic reversion 
events were observed. Germline reversion events were observed in the progeny of unc-22; 
oxEx[glh-2:: Tc3A] hermaphrodites (experiment #1: 2/1914 total scored animals; experiment 
#2: 5/4312). It was concluded tha t rih 2::Tc3A zlh-2::Tc3A causes a 0.1% reversion rate of the 
unc 22(r750::Tc3) unc-22(r 750: :Tc3) locus in the germline. 

Example 2 - Expression of the Mariner Transposase in C elesans 
[0052] A mutagenesis strategy was also developed that uses the mariner transposon 
from the hornfly (gift of David Lampe and Hugh Robertson). Mariner transposons from 
Drosophila are related to C elegans Tc transposons. In fact, members of the Tdmariner family 
of transposable elements have been identified in a broad range of species. R. H. A. Plasterk & 
H. G. A. M. van Luenen, Transposons, in C. elegans 7/97-116 (D.L. Riddle et al. eds., 1997). 
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Horizontal transfer may be responsible for the broad distribution of this family of transposable 
elements. Horizontal transfer implies that specific host factors are not required for transposition 
and biochemical characterization has borne this supposition out. Purified transposase is able to 
catalyze the transposition of mariner or of Tc elements from a host plasmid to a target plasmid. 
D. J. Lampe et al. EMBOJ EMBOJ. 15:5470-5479 (1996); J. C. Vos et ah. Genes Dcv Genes 
Dev. 10:755-761 (1996). This has enabled researchers to mobilize mariner elements from 
Drosophila in other Dipteran species. T. G. Loukeris et al.. Proc Natl Acad Sci USA Proc. Natl 
Acad. Sci. USA 92:9485-9489 (1995); T. G. Loukeris et al., Science 270:2002-2005 (1995); A. R. 
Lohe & D. L. Hartl, Genetics 143:365-374 (1996). Recently, a mariner element from 
Drosophila has been mobilized in Leishmania, which represents a trans-kingdom transposition. 
F. J. Gueiros-Filho & S. M. Beverly, Science 276:1716-1719 (1997). Thus, it was possible that 
mariner would be active in C. elegans as well. 

[0053] A plasmid encoding the mariner transposase HIMAR1 was received from David 
Lampe and Hugh Robertson. First, the transposase coding sequence was engineered to allow for 
efficient expression in C. elegans. Restriction sites were inserted immediately upstream to the 
start codon and just before the stop codon to facilitate subcloning of the fragment in various 
expression vectors. An artificial intron was inserted in the open reading frame since the presence 
of introns improves the expression level of transgenes in C. elegans. 

[0054] Engineered Himarl was placed under the control of the muscl e sp e cific 
muscle-specific p romoter myo-3. The myo-3 r.Himarl construct was injected with the lin-15(+) 
plasmid into a lin-15(-) strain and unstable transgenic strains obtained. Expression of the 
HMARl transposase was examined first by Western Blot. Extracts were prepared from 
oxExfmvo 3 . .Himarl: oxExfmvo-3:: Himarl \ lin-15{+)J worms, run on a denaturing acrylamide 
gel and transferred to a nitrocellulose membrane. The membrane was probed with previously 
characterized antibodies that recognize the HIMAR1 protein (provided by David Lampe and 
Hugh Robertson). In extracts of transgenic worms, an approximately 42 kD protein which 
corresponds to the expected molecular weight of HEMAR1 was detected. The signal was absent 
from non-transgenic worm extracts. Using the same antibodies, the protein was visualized in situ 
using immunofluorescence on oxEx[myo-3::Himarl\ lin-15(+)] worms. Intense 
immunoreactivity was detected which was restricted to the nuclei of muscle cells. These data 
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indicate that the HIMAR1 mariner transposase is expressed and properly targeted to nuclei in C. 
elegans cells. 



Example 3 - Germline Expression of the Transposase Using the g/A-2 Promoter 
[0055] Generation of heritable Mosl insertions would require expression of the Mos 
transposase in the germline. However, expression of transgenes in the germline of C. elegans is 
not possible using standard techniques. Typically, transgenic worms are generated by injecting 
plasmid DNA into the gonads of C. elegans (C. C. Mello, J. M. Kramer, D. Stinchcomb, V. 
Ambros, Embo J. 10, 3959 (1991)). These fragments then form a simple array of repeated DNA 
segments. Although gene expression is robust in somatic tissues, such simple arrays are not 
expressed in the germline or are silenced after a few generations. Co-injection of genomic DNA 
with plasmid DNA improves germline expression, presumably by preventing tandem repeats in 
the array. W. G. Kelly, S. Xu, M. K. Montgomery, A. Fire, Genetics 146, 227 (1997). To 
express the Mos transposase in the germline, an expression vector containing the promoter and 
the 3' UTR of the glh-2 gene was built. This gene encodes a germline helicase which is 
specifically expressed in the gonad. M. E. Gruidl, et al., Proc. Natl Acad, Sci. USA 93, 13837 
(1996). Transgenic lines carrying extrachromosomal arrays of the glh-2:: Mos transposase 
construct were generated by microinjection. To maximize expression in the germline, constructs 
were isolated from plasmid vector sequences and were coinjected with fragmented genomic 
DNA. (The Mos transposase coding sequence was introduced between the promoter and the 3' 
UTR of glh-2. Specifically, this construct (pJL9) contains 2.2 kb of the glh-2 genomic sequence 
immediately upstream of the translation start site (nt 29,882 to 32,095 in cosmid C55B7), an 
Mlu I-Nhe I cloning site, and 0.8 kb of sequence immediately downstream of the glh-2 stop 
codon. An Mlu I-Nhe I fragment containing the Mos transposase was subcloned into pJL9 to 
generate the glh-2: :MosTransposase construct. Iin-15(n765) hermaphrodites were injected with 
a Spe I - Kpn I fragment o f glh 2::MosTransposasc glh-2: :MosTransposase (injection 
concentration 10 ng/jil), with lin-15(+) (EKL15) and ofm l::ssfi> ofm-J::gfp (pPD97/98) 
fragments and N2 worm genomic DNA as described above for the generation of the 
oxExJ66fhsp: :MosTransposase]-&&&<fr ) array. 
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[0056] Transposase expression in the germline was determined by assaying for 
excision of transposons from a defined chromosomal location. Specifically, the Masi-containing 
extrachromosomal array was integrated into chromosome V to generate oxIs25[Mosl ;rol - 6(sd)]. 
ox!s25 [Mosl ;rol-6(sd)] . The oxIs25 array was mapped less than 0.54 map units from dpy-11. 
Heterozygous oxIsl2/dpy-l 1 worms were generated. These animals largely segregated Dpy and 
Rol progeny as expected for these closely linked markers (Figure 6). However, when the 
glh 2::Mos slh-2::Mos Transposase transgene was crossed in, approximately 16% of the 
nonDpy progeny were nonRol (15.7 % (15.7% ± 0.9, mean ± SEM, n=44 plates). The nonRol 
phenotype was stably inherited. It was hypothesized that in those worms, the Mos transposase 
excised-^feff^ - Mosl from the integrated array. The resulting DNA breaks were responsible for 
catastrophic excision of the entire locus, including interspersed rol-6 copies. The correlated loss 
of rol-6(sd) &ndi-Mes-l- Mosl was confirmed by PCR, in which NonRol individuals were cloned, 
selfed and DNA was purified from the progeny. Mosl containing Mas7-containing fragments 
were detected by PCR using one primer complementary to Mosl (oJL102: 
5 -CAACCTTGACTGTCGAACCACCATAG (SEQ ID NO: 1)) and one primer complementary 

to — mauritiana flanking — D. mauritiana-fiwkinz DNA (oJL104: 

ACAAAGAGCGAACGCAGACGAGT 5 -ACAAAGAGCGAACGCAGACGAGT (SEP ID 
NO: 2)). Of 188 nonRol worms, only one individual retained a copy of the Mos 1 Mosl 
fragment that was initially present in the transgene. Based on the phenotypic reversion of the 
Rol phenotype, it was calculated that 1 in 5 chromosomes experienced catastrophic excision of 
the transgene (20.9% ±1.1 %, mean ± SEM, n=41 plates) plates). The probability p of a single 
chromosome containing the array of Mosl elements transgene experiencing "catastrophic 
excision" can be derived from a Punnett square where the ratio R of nonRol worms over the total 
number of the progeny: R = l/4p + l/4p + (l/2p) 2 . These results demonstrated that the 
g7A-2-based expression vector expressed the transposase in the germline and that the Mos 1 
Mosl transposon in the chromosome was recognized as a substrate. 

[0057] To determine if excision o f Mos 1 Mosl from the array was associated with 
insertion in the genome, the progeny of animals expressing the transposase in the germline were 
screened for de novo insertions. Specifically, using PCR, the presence of the Mos 1 Mosl 
element in the absence of the Drosophila sequences which flank the transposon in the array was 
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assayed. Mosl presence was assessed by PCR using two primers located in the transposon 
(oJL102 (SEQ ID NO: 1) and oJL103: 5' TCTGCGAGTTGTTTTTGCGTTTGAG 
5 -TCTGCGAGTTGTTTTTGCGTTTGAG (SEP ID NO: 3)). The absence o f D. mauritiana 
flanking D. mauritiana-flankinz sequence was checked using oJL102 (SEQ ID NO: 1) and 
oJL104 (SEQ ID NO: 2) as described above. In addition, a PCR positive control was performed 
on each DNA sample using oligonucleotides located in the cha-1 gene. Because the integrated 
array containing immobilized transposons also contained rol-6(sd), insertions were sought in 
nonRol progeny; specifically, either nonRol animals that experienced catastrophic excision of the 
array or Dpy progeny (Figure 6B) were analyzed. Insertions were identified in 1% of nonRol 
progeny (2/227) and in 10 % 10% of Dpy progeny (11/116 F1+F2 Dpys). These results 
demonstrated that transposition o f Mos 1 Mosl could be achieved in the C. elegans germline. 
However, it was observed that high rates of excision were not accompanied by high rates of 
insertion; these results support previous data indicating that these two processes are not coupled. 
H. G. van Luenen, S. D. Colloms, R. H. Plasterk, Cell 79, 293 (1994). 

[0058] Using integrated arrays as a source of transposons prevents the easy recovery of 
new insertions that occur on the same chromosome; this bias could be circumvented by using an 
extrachromosomal array of transposons. In addition, extrachromosomal arrays are not 
completely stable in meiosis, which makes the isolation of strains lacking immobilized 
transposons easy after mobilization. Therefore, it was tested whethe r Mos 1 Mosl could be 
mobilized from an extrachromosomal array into the chromosomes. Specifically, the glh 2::Mos 
zlh-2::Mos Transposase construct was used to mobilize transposons from a Mos 1 b e aring 
Mos 1 -bearing arra y (oxExl6 4 fMosl; rol 6(sd)]). (oxEx 164 [Mosl: rol-6{sd)J). The nonRol 
progeny from double transgenic animals (oxExl67[glh 2::MosTransposas e ]; oxExl6 4 [Mosl; 
rol 6(sd)P (oxExl67[zlh-2::MosTransposaseJ: oxExl64[Mosl; rol-6fsd)J) w ere analyzed for 
transposition events using PCR. An insertion frequency of 1% (3 insertions/ 302 progeny, 
Table 1) was detected. Thus, these results closely match those obtained for integrated arrays. 

[0059] Table 1. Frequencies of Mosl genomic insertions from an extrachromosomal 
array. nonRol progeny of oxExl64[Mosl\ rol-6(sd)]\ oxExfMosTransposaseJ were analyzed by 
PCR for the presence of Mosl and the loss of the Drosophila flanking Drosophila-flankinz 



21 



sequences present in the donor plasmid. -Mes4 - Mosl p resence was assessed by PCR using two 
primers located in the transposon (oJL102 (SEQ ID NO: 1) and oJL103: — 
TCTGCGAGTTGTTTTTGCGTTTGAG 5'-TCTGCGAGTTGTTTTTGCGTTTGAG (SEP ED 
NO: 3)). The absence o f D. mauritiana flanking D. mauritiana-fiemkinz sequence was checked 
using oJL102 (SEQ ID NO: 1) and oJL104 (SEQ ID NO: 2) as described above. In addition, a 
PCR positive control was performed on each DNA sample using oligonucleotides located in the 
cha-1 gene. When h e at shock e d heat-shocked (1 hour at 35°C) POs were moved to fresh plates 
and eggs were collected for the next 24 hours. *During experiment-^ - #5, the stability of the 
oxExl64[Mosl\ rol-6(sd)J transgene reached 75 % 75%, while in previous e xp e riments 
experiments, it was approximately-20-% 7 20%. 



Transposase construct 


Transposition frequency 




no heat-shock 


with heat-shock 


glh-2: .MosTransposase 


exp#l: 2/108 1.9% 
exp#2: 0/104 0% 
exp#3: 1/90 1.1% 


exp#l: ND 
exp#2: ND 

exp#3: 0/65 0% 


hsp: :Mos Transposase 


exp#l: ND 
exp#2: 0/33 
exp#3: 0/39 
exp#4: ND 
exp#5: 0/44* 


exp#l: 4/65 6.2% 
exp#2: 3/98 3.1% 
exp#3: 6/87 6.8% 
exp#4: 5/85 5.9% 

exp#5: 15/34* 44.1% 



Example 4 - Mos 1 Mosl - Mobilization in the Germline Using a Heat-Shock Promoter 
[0060] The glh-2 promoter expresses the transposase in the germline constitutively. 
Constitutive expression of the transposase has two disadvantages. First, crosses must be set up 
fresh every generation to guarantee that the array remains intact and does not accumulate 
inherited changes. Second, because the tranposase was expressed in the germline early in 
development, events identified in the progeny might not be independent but might have occurred 
when the germline was still comprised of only few cells. Expression limited to adults can be 
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achieved by using a heat-shock promoter. Expression of the transposase could be induced after a 
strain containing the transposase and transposons had been propagated and expanded to many 
animals. In addition, h e at shocking heat-shocking animals with mature germlines would 
maximize the independence of insertion events. Animals expressing the transposase under the 
control of the heatshock heat-shock p romoter and bearing the integrated transposon 
(oxIs25/dpy 11; — oxEx 1 66[hsp: :Mos — Transposas e ]) — (ox!s25/dpv-l 1 ; oxExl 66[hsp::Mos 
Transposase!) were heat-shocked. POs could only be heat-shocked for 45 minutes; such animals 
were almost paralyzed, stopped eating and had low brood sizes. Longe r h e atshock heat-shock 
caused the animals to die. It was speculated that this lethality is due to high rates of transposition 
in somatic cells. Ubiquitous expression of the transposase would cause double strand breaks in 
the chromosome at the site of integration in every cell which may cause cell cycle arrest or 
apoptosis. G. Evan, T. Littlewood, Science 281, 1317 (1998). 

[0061] Fl progeny were analyzed for catastrophic excision, that is, for the appearance 
of nonRol nonDpy progeny (Figure 6A). In contrast with results obtained using the glh-2 
expression vector, catastrophic excision was not observed. Only rare nonRol progeny were 
generated which were likely to be the result of recombination between the array and the dpy 
marker (Figure 6B). However, by analyzing the Dpy progeny of heat-shocked animals, it was 
discovered that the h e atshock heat-shock construct caused the efficient insertion of Mos into 
new locations in the genome. Approximately 27% (25/94) of Fl or F2 Dpy worms carried novel 
transposon insertions (Figure 6B). In addition to novel insertions, recombinant chromosomes 
containing both the dpy marker and the original array arose at high frequency. These were 
identified as Dpy animals containing transposons flanked by Drosophila DNA. This hotspot for 
recombination is likely to arise as a result of double strand breaks introduced into the array by 
the transposase. A. R. Lohe, C. Timmons, I. Beerman, E. R. Lozovskaya, D. L. Hartl, Genetics 
154, 647 (2000). 

[0062] It was then tested whether transposition could occur from an extrachromosomal 
array using the h e atshock heat-shock p romoter construct to express the transposase (oxExl66; 
oxExl6 4 rMosl; rol 6(sd)1). oxExl64[Mosl: rol-6(sd)J). Hermaphrodites bearing both arrays 
were heat-shocked as young adults. The nonRol progeny were analyzed by PCR for 
transposition events. New insertions were observed in 8.9% of the Fl (33/369 progeny, 
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Table 1). Since transposition could have occurred into the transposas e — containing 
transposase-containing array, F2 animals that lost this array were isolated from eight 
transposon-bearing strains. Genomic DNA was prepared from these strains and analyzed for the 
presence of Mosl. The transposon was still detected in-aH-8 - all eight strains, thus demonstrating 
that the transposon had not inserted in the array. No insertions could be detected in 116 Fl 
clones derived from non-heat-shocked parents. The frequency of transposition was low but one 
of the main limiting factors is the stability of the extrachromosomal array that is used as a 
transposon source. The initial experiments were performed when the array was only 20% stable 
and transposition frequencies were in the range of 5%; when the array matured and was about-7£ 
%- 75% stable, transposition frequency reached 44%. Generating more stable extrachromosomal 
arrays could increase the frequencies of transposition. 

[0063] The h e atshock heat-shock p romoter was able to drive expression of Mos 
transposase in the germline and to promote transposition events at a higher rate than obtained 
using the glh-2 construct. Temperature has been shown to affect transposition frequency in other 
organisms. D. Garza, M. Medhora, A. Koga, D. L. Hartl, Genetics 128, 303 (1991). One 
possible explanation for the efficient transposition observed afte r h e atshock heat-shock is that 
chromatin structure is somehow altered by the h e atshock. heat-shock. Therefore, it was tested 
whethe r h e atshock heat-shock itself could account for the difference in transposition 
frequencies. Parents with extrachromosomal arrays carrying the g/A-2-transposase construct and 
carrying the transposon were h e atshock e d heat-shocked and progeny were tested for 
transposition. The frequency of transposition was not improved by the h e atshock heat-shock 
treatment (Table 1). Thus, h e atshock heat-shock itself does not facilitate efficient transposition. 

Example 5 - Transposition in Somatic Cells 
[0064] To determine whether the Mosl element could be mobilized in C. elegans cells, 
Mosl transposition in somatic cells was first analyzed. The gene encoding the Mosl transposase 
was engineered to improve expression in the worm and placed under the control of a heat-shock 
promoter. The Mos transposase encoding sequence was PGR amplified out of pBluescribe 
Mtt+IMosl (M. Medhora, K. Maruyama, D. L. Hartl, Genetics 128, 311 (1991), modified as 
described in Figure 5A and subcloned as a Mlu I Nho I Mlu I-Nhe I fragment between the 
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hsp- 16-48 promoter (H. G. van Luenen, S. D. Colloms, R. H. Plasterk, Embo J. 12, 2513 
(1993)); E. P. Candido, et al., Genome 31, 690 (1989) and the glh-2 3' untranslated region 
(fragment 35383 to 36190 in cosmid C55B7) (M. E. Gruidl, et al., Proa Natl Acad. Sci. USA 
93, 13837 (1996) (Figure 5A). The resulting construct was used to generate the 
extrachromosomal array oxExl 66 [hsp : :Mos Transposase] . — (lin 15(n765) — (lin-15(n765) 
hermaphrodites were injected in the syncitial gonad with a mixture of the following gel-purified 
fragments: a Hind III-Eco RI fragment of hsp::Mos Transposase (injection concentration: 10 
ng/^il), a Pst I-Bsi WI fragment of the ofm-l::gfp construct (pPD97/98) that expresses GFP in the 
coelomocytes (gift of Piali Sengupta) (injection concentration: 5 ng/nl) and a Kpn I-Eag I 
fragment of EKL15(/m-75 +) (S. G. Clark, X. Lu, H. R. Horvitz, Genetics 137, 987 (1994)) 
(injection concentration: 10 ng/^1). Plasmid backbones were removed from all purified 
fragments. Eco RV-digested N2 worm genomic DNA was coinjected at a concentration of 70 
ng/jil. Another extrachromosomal array, oxExl64[Mosl; rol-6(sd)] 9 contained the Mosl 
transposon. (Iin-15(n765) hermaphrodites were injected with a 2.2 kb Xho I-Hind III fragment 
of pBluescribe Ml3+/Mosl that contains the 1.3 kb Mosl element flanked by D. simulans 
sequences (M. Medhora, K. Maruyama, D. L. Haiti, Genetics 128, 311 (1991) (injection 
concentration: 10 ng/^il) and a 2.2 kb rol-6(sd) fragment of pRF4 (J. M. Kramer, R. P. French, 
E. C. Park, J. J. Johnson, Mol Cell Biol 10, 2081 (1990) (injection concentration: 10 ng/^il). 
Eco RV-digested N2 worm genomic DNA was coinjected at a concentration of 80 ng/jil to 
increase the complexity of the array; this array also contained the dominant genetic marker 
rol-6(sd) which causes animals to roll instead of swimming in a sinusoidal fashion. These two 
strains were crossed and progeny carrying both arrays were h e at shock e d heat-shocked as young 
adults. After 12 hours, the heat-shocked animals were harvested and genomic DNA was 
prepared. Mosl transposition was detected using the strategy developed by van Luenen et al. 
See H. G. van Luenen, S. D. Colloms, R. H. Plasterk, Embo J. 12, 2513 (1993). Specifically, 
insertions were identified by PCR amplification using one set of primers complementary to the 
transposon and another set complementary to an arbitrary target gene. DNA purification and 
PCR were performed as described in H. G. van Luenen, S. D. Colloms, R. H. Plasterk, Embo J, 
12, 2513 (1993). The primers in Mosl were 0JL88 (5 -CGCATGCGGCTTACTCAC (SEQ ID 
NO: 4)) first PCR; and oJL89 (5 '-GGCCCCATCCGATTACCACCTA (SEQ ID NO: 5)) second 
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PCR. Primers in unc-49 were oJL19 (5 '-GCGAAACGCATACCAACTGTA (SEQ ID NO: 6)) 
first PCR; and oJL20 (5 '-TTCATGCCGAAAAGCAGGCGT (SEQ ID NO: 7)) second PCR. 
Primers in gpa-2 were the same as described in H. G. van Luenen, S. D. Colloms, R. H. Plasterk, 
Embo J, 12, 2513 (1993). PCR products were gel-purified and sequenced using oJL89 (SEQ ID 
NO: 5) as a primer. A PCR product can be obtained only if a transposon has integrated into the 
target gene. The method is sensitive enough to detect a single insertion in the target gene in a 
single somatic cell of an adult animal Insertions in two genes were assayed: the gpa-2 gene 
which encodes a G protein subunit (R. R. Zwaal, J. E. Mendel, P. W. Sternberg, R. H. Plasterk, 
Genetics 145, 715 (1997)), and the unc-49 gene which encodes a GABA receptor. B. A. 
Bamber, A. A. Beg, R. E. Twyman, E. M. Jorgensen, J. Neurosci. 19, 5348 (1999). Mosl 
insertions were detected in both genes (2.5 ±1.0 inserts in 10 ng of genomic DNA, mean ± S.D., 
n=5 experiments). Given that the maximal distance of the inserts from our gene primers was 
approximatelv-4kbr - 1 kb, it was estimated that an average of 10 insertions occurred per cell in 
heat-shocked animals. Insertions were also detected at low frequency in worms that contained 
the transposon array but lacked the transposase expression construct (0.09 insertions in 10 ng 
DNA, n=2 experiments). These data indicated that low levels of Mosl tranposas e transposase 
were expressed from the intact Mosl transposons in the extrachromosomal array. 

[0065] To demonstrate that these transposon insertions represented bona fide 
transposition events, PCR products were gel-purified and the sequence of the insertion sites from 
the somatic transposition assays was determined. In all cases, the Mosl inverted terminal repeats 
were complete, the Drosophila sequences that flanked Mosl in the donor plasmid were no longer 
present, and the insertions all took place at a TA dinucleotide. Transposon insertions were 
distributed uniformly in exons, introns and 3' noncoding sequences of gpa-2 and unc-49 
(Figure 5B). Comparison of 22 insertion sites did not reveal a strong consensus site apart from a 
bias toward a T at position +1 with respect to the TA dinucleotide (Figure 5C). These data 
demonstrated that Mosl can hop into C. elegans chromosomes and that the transposase was 
sufficient to catalyze insertion without Drosophila host factors. 
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Example 6 - Introduction of Mariner Transposon Copies into the C. elegans Genome 
[0066] The full-length copy of the hornfly mariner transposon Autmar was gel-purified 
to remove non-nematode plasmid sequences. Purified Autmar was injected with linearized 
C elegans genomic DNA and the rol-6(dm) plasmid into Un-15(n765ts) worms and unstable 
transgenic strains were recovered. Due to the presence of rol-6(dm) in the array, transgenic 
animals roll instead of displaying normal sinusoidal locomotory movements. These animals are 
Lin when grown at the nonpermissive temperature because they are genotypically lin-15(-). This 
array was integrated into a chromosomal location to generate the oxh21 insertion. oxh21 was 
mapped to chromosome X, 2.5 m.u. away from the lon-2 locus. 

Example 7 -The Mariner Transposase Can Excise Mariner Transposons from C. Elesans 

Chromosomes in the Germline 

[0067] Engineered Himarl was inserted in the glh-2 germline expression cassette 
described above. The glh-2 ::Himarl construct was co-injected with linearized C. elegans 
genomic DNA and the lin-15(+) plasmid into Un-15(n765ts) worms. The oxExllS 
extrachromosomal array is transmitted at each generation to a large fraction of the progeny. 

[0068] lin-15(n 765ts)\ oxExlMzlh 2::Himarl;lin 15(+)1 oxExl 1 9 felh-2:: Himarl; 
lin-15(+)J males were crossed into Hn-15(n765ts) oxIs21 [Autmar; rol-6(sd)J hermaphrodites. 
As predicted, animals of the cross-progeny were Rol nonLin. At the next generation, it was 
expected that 1/3 of the Rol animals would be found to be homozygous for oxIs21. However, 
among 48 Rol nonLin cloned individuals, none segregated more than approximately 75% Rols, 
while 6 of 15 Rol Lin hermaphrodites segregated 100% Rol progeny. After careful 
characterization of the progeny of parent animals exhibiting various phenotypes, it was 
concluded that oxExllS could elicit the reversion of the Rol phenotype. Presumably, the 
reversion is caused by excision of the Autmar transposons from the integrated array which in 
turn leads to loss of the adjacent rol-6(dm) genes by imprecise repair of the locus. It was 
concluded that the mariner transposase can excise mariner transposons from C. elegans 
chromosomes in the germline. 
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Example 8 - Mobilization of a H e tcrolo g-«s- Heterologous Marine r Tran s po s h - Transposon 

in the C eleeans Genome 

Materials and Methods: 
Reagents 

[0069] Mosl containing M osl -containing strain: 

[0070] The transgenic strain EG 163 8 that contains Mosl has been generated by 
coinjection of Un-15(n765) worms with: 

the 2.2 kB Xho 1 Hind III Xho I-Hind III fragment of pBluescribe M 13+/Mosl, 
\3+/MosL M. Medhora et al., Genetics 128:311-318 (1991); (injection 
concentration: lOng/^il) 

the 2.2 kB Hind III rol~6 rescuing fragment containing the semi-dominant 
mutation rol-6(sul006), J. M. Kramer et al.. Mol Cell Biol Mol Cell Biol 
10:2081-2089 (1990); (injection concentration: 10 ng/^1) 

EcoRV dig e st e d EcoRV-digested g enomic DNA prepared from N2 worms 

(injectio n conc e n tration: concentration: 80 ng/^1) 
[0071] The resulting strain lin-15(ts)\ oxExl64[Mosl\ rol-6(sd)] exhibits a Rol Muv 
phenotype when grown above 20°C. The Muv phenotype is not expressed when worms are 
grown at 15°C. 

[0072] Mos 1 transposas e e xpr e ssing M osl transpose-expressing strain: 
[0073] As shown in Figure 2, the expression vector pJL44 (HSP::MosTas e ::glh 2) 
(HSP: :MosTase: :zlh-2) contains the following elements: 

a 377 bp Hsp 16-48 heat-shock promoter fragment recovered by PCR from 
pRP176, H. G. van Luenen et %\.*-EMBQ-J - EMBO J. 12:2513-2520 (1993), using 
the oligos oJL21 5 -CG AAGCTTGCTGGACGGAAATAGTGG (SEQ ID NO: 
19) and oJL22 5 '-CGACGCGTTCTTGAAGTTTAGAGAAT (SEQ ID NO: 20). 
a 1088 bp fragment containing the Mosl transposase coding sequence amplified 
by PCR from pBluescribe M 13+/Mool 13+/Mosl using oJL77— 
GCACGCGTTATGTCGAGTTTCGTGCCGAATAAAG 

5 -GCACGCGTTATGTCGAGTTTCGTGCCGAATAAAG (SEP ID NO: 21) 
and oJL78 
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GCGCTAGCTATTCAAAGTATTTGCCGTCGCTCGCGACACATTTTTCCCA 
5 -GCGCTAGCTATTCAAAGTATTTGCCGTCGCTCGCGACACATTTTTCC 

CA (SEP ID NO: 22). An artificial intron 

GTAAGTTTAAACATATATACTAACTAACCCATGGATTA 
TTTAAATTTTCAG 3' 

5 -GTAAGTTTAAACATATATACTAACTAACCCATGGATTATTTAAATTT 
TCAG-3_L(SEQ ID NO: 18) was inserted at position 264 with respect to the ATG. 
a 300 bp fragment containing the glh-2 3UTR (nt 3287 to 4087 with respect to 
glh-2 start codon) recovered form a 6.3 kb glh-2 genomic fragment subcloned in 
pBluescript KS (Stratagene) (M. E. Gruidl et al.. Proc Natl Acad Sci Proc. Natl. 
Acad. Sci. USA 93:13837-13842 (1996); gift of Karen Bennett, University of 
Missouri). 

[0074] The transgenic strain EG 1643 that contains the Mosl transposase expression 
vector has been generated by coinjection of Un-15(n765) worms with: 

the Hind III EcoRl Hind III-EcoRJ fragment of pJL44 (injection concentration: 
10 ng/ul) lOng/ul) 

the Eag I-Kpn I lin-15 genomic rescuing fragment from EKL15, S. L. Mclntire et 
al., Nature 389:870-876 (1997) 

the Pst I-BsiW I fragment of pPD97/98 that drives expression of the Green 
Fluorescent Protein in the coelomocytes (gift of Piali Sengupta, Brandeis 
University) (injection concentration: 10 ng/ul) 

EooRV dig e st e d EcoRV-digested g enomic DNA prepared from N2 worms 

(injection concentration: 80 ng/ul) 
[0075] The resulting strain lin-15(ts); oxExl66[hsp::MosTase:.glh-2; pPD97 198;-Hn- 
15(+)] lin-15(+)J has a wild-type phenotype. The presence of the extrachromosomal array 
causes expression of GFP in the coelomocytes which can be visualized using fluorescence 
microscopy. 
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Mobilization of the transposon in the C elegans genome 

[0076] Mobilization of Mosl was achieved by crossing the transposase-expressing 
strain into worms containing the Mos 1 Mosl transposon-containing array. lin~15(ts)\ 
oxExl66[hsp::MosTase:.~g!h-2; pPD97198\ lin-15(+)] hermaphrodites were crossed with N2 
males at 25°C. Non-Muv males lin-15(ts)\ oxExl66 were crossed with lin-15(ts)\ 
oxEx 164 [Mosl; rol-6(sd)] Rol non-Muv hermaphrodites previously grown at 15°C. 

[0077] The cross was kept at 20°C. Late L4 larvae or young adult Rol worms were 
transferred to a fresh plate and heat-shocked for 1 hour at 35°C. After 6 hours, non-Muv Rol P0 
animals (lin~15(ts)\ oxExl64\ oxExl66) were transferred to a fresh plate and allowed to lay eggs 
for 48 hours. A fraction of the Fl animals contain insertions o f Mos 1 Mosl in their genome 
and can be screened for mutant phenotypes. 

Identification of transposon insertion sites 

[0078] Mosl ins e rtion insertions were identified by inverse PCR, as shown in Figure 3. 
Genomic DNA was prepared according to standard procedure. Approximately 100 ng of 
genomic DNA was digested by Sau3A in a 10 ^1 volume for 3 to 14 hours. The restriction 
enzyme was inactivated by heating for 20 minutes at 70°C. Fragments were circularized b y s e lf - 
ligation self-ligation (overnight incubation at 15°C with 5 units of T4 DNA ligase). 

[0079] 3 of ligated DNA was used for PCR amplification. A first round of 

amplification was performed using the primers oJL103 & 1 - 

TCTGCGAGTTGTTTTTGCGTTTGAG 5 -TCTGCGAGTTGTTTTTGCGTTTGAG (SEP ID 

NO: 3) and oJL114 5' AAAGATTCAGAAGGTCGGTAGATGGG 

5 - AAAGATTC AGAAGGTCGGT AGATGGG (SEP ID NO: 10) (30 cycles,-4^ - 45 seconds at 
94°C 7-4^ - 1 minute at 60°C 7-1-4^ - 1 minute 15 seconds at 72°C, magnesium chloride 
concentration: 1.5 mM). The product of the first amplification was diluted 100-fold and 
subjected to a second round of amplification using the nested primers oJL115— 
GCTCAATTCGCGCCAAACTATG 5 '-GCTC AATTCGCGCC AAACTATG (SEP ID NP: 1 1) 
and 0JLII6 5' GAACGAGAGGCAGATGGAGAGG 5 -GAACGAGAGGCAGATGGAGAGG 
(SEQ ID NG: 12) (25 cycles,^ - 45 seconds at 94°C 7-4 ^ 1 minute at 62°C /VIS" 1 minute 15 
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seconds at 72°C, magnesium chloride concentration: 2.5 mM). Resulting fragments were run on 
an agarose gel, gel-purified and sequenced either directly or after subcloning. 

[0080] Figure 4 contains the sequence of an inverse PCR product demonstrating 
insertion of Mosl in chromosome X. Nucleotides in capital letters are from the Mosl 
transposon. C. elegans flanking C. e/ejg<ms-flanking g enomic region is in lower case. It 
matches the Y47C4.Contig215 sequence from chromosome X available at the Sanger Centre. 

[0081] Mosl, a mariner-like transposon isolated from Drosophila mauritiana was used. 
Transgenic worms containing Mosl in an extrachromosomal array were crossed with transgenic 
worms containing an expression vector in which a heat-shock promoter (hsp 16-48) drives the 
expression of the mos transposase (Figure 1). Cross-progeny containing both the Mosl 
transposon and the mos transposase were isolated. Heat-shock of these worms induced the 
expression of the transposase which in turn caused Mos 1 Mosl elements to transpose from the 
extrachromosomal array into the C. elegans genome. Five insertions were isolated, for a rate of 
one in seventeen animals analyzed. However, this array is only 20% stable per generation. 
Thus, th e r e in therein on average one transposition into chromosomes for every three germ cells 
exposed to the transposon. 

[0082] Some insertions will disrupt genes and cause mutant phenotypes. Mutant 
worms are outcrossed with wild-type worms containing no Mosl transposon. Since the insertion 
responsible for the mutation cosegregates with the mutant phenotype, it is possible to isolate the 
single relevant Mosl insertion after only a few outcrosses. Genomic DNA is then prepared from 
the outcrossed mutant. Regions flanking the transposon are recovered by inverse PCR and 
sequenced. Comparison of flanking sequences with the C. elegans genome sequence allows 
immediate identification of the mutated gene. This new mutagenesis system will significantly 
speed up the identification of genes of interest using C. elegans as a genetic model. 

Example 9 - Mosl Mosl Mutagenesis and Rapid Cloning of Genes 
[0083] In one embodiment, the method described in this invention is capable of 
generating mutations which can be rapidly cloned based on the Mosl Mosl unique DNA tag. 
To demonstrate that this is true, mutants have been identified and the relevant genes have been 
cloned using inverse PCR. Specifically, a morphological mutant in C. elegans was isolated 
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which causes the worms to be short and squat. Such mutations are called dumpy mutations and 
are given the three letter designation "dpy." A dumpy animal was identified after mobilization 
of the wild-type Mosl transposon. DNA was prepared, cleaved with the restriction enzyme 
Sau3A, and religated. Inverse PCR was performed using primers contained within the 
transposon but facing outward. The amplified fragment was sequenced. The Mosl element was 
inserted 175 nucleotides 5' of F54D8.1, which encodes a collagen protein. An inspection of the 
genetic map demonstrated that this insertion is in a chromosomal interval which also contains the 
dpy- 17 gene which had been previously defined by point mutations using chemical mutagens by 
Sydney Brenner in 1974. A complementation test was performed and the test demonstrated that 
this mutation was an allele of dpy-1 7. Thus, the method is capable of rapidly demonstrating the 
molecular identity of a gene which had remained unknown for almost 30 years. Mutants 
incapable of detecting high osmotic gradients (Osm) were also screened for. The first Osm 
mutant identified was cloned in a similar manner and proved to be an insertion of Mosl in 
exon 10 of the eat-4 gene. 



Example 10 - Targets of Transposase and Transposon 
[0084] For Mosl insertions to be useful for the cloning of mutated genes, the 
transposase must specifically mobilize Mosl and not other mariner elements. The C. elegans 
genome contains endogenous transposons. Apart from the most active Tel and Tc3 transposons, 
which are distantly related to Mosl, every haploid genome contains at least 55 copies of a 
Mariner Like Element (MLE), which is closely related to Mosl. M. M. Sedensky, S. J. Hudson, 
B. Everson, P. G. Morgan, Nucleic Acids Res. 22, 1719 (1994); H. M. Robertson, D. J. Lampe, 
Mol. Biol. Evol. 12, 850 (1995). Since in a few cases transposases of the Mariner family have 
been shown to cross-mobilize distinct but related transposons (P. Sundararajan, P. W. Atkinson, 
D. A. O'Brochta, Insect Mol Biol. 8, 359 (1999)), it was tested whether-Mes - Mos transposase 
expression had triggered transposition of the endogenous MLEs. Eight strains in which Mosl 
insertions had occurred were analyzed by Southern blot for changes in MLE distribution. No 
changes in MLE distribution were detected. Worm genomic DNA of Un~15(n765) and 
A/oy/-containing strains was extracted, Bgl II digested and run for Southern blot analysis using 
standard procedures. Oligos oJL132: 5 -ATATGCGGTGCGATGGGTGAG (SEQ ID NO: 8) 
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and oJL133: 5 -GGCGAACGCGATGAGAAGAAAG (SEQ ID NO: 9) were used to amplify a 
842 bp MLE fragment from N2 worm genomic DNA. The PCR product was sequenced and 
used for probe synth e sis, synthesis (data not shown) shown), indicating that Mos Mos 
transposase is specific for Mosl in the C. elegans germline. 

[0085] How many insertions occurred in every animal and what were their 
distributions? The number of chromosomal insertions per strain was determined by Southern 
blot analysis in eight insertion strains. Only one insertion per strain was detected (Figure 7A). 
To determine the location of the mobilized transposons, the left junctions of 17 insertions were 
cloned using inverse PCR. Approximately 100 ng of total genomic DNA was digested with 
Sau3A, self-ligated under dilute conditions, and then-^-% - 3% of the ligation was subjected to 
two rounds of nested PCR using the following primers: oJL103 (SEQ ID NO: 3)/oJL114 
5 -AAAGATTCAGAAGGTCGGTAGATGGG (SEQ ID NO: 10) (first PCR), oJL115 

5 -GCTCAATTCGCGCCAAACTATG (SEQ ID NO: 11) / 0JLII6 

GAACGAGAGGCAGATGGAGAGG 5 '-GAACGAGAGGCAGATGGAGAGG (SEP ID NO: 
12) (second PCR). PCR products were purified on agarose gel and sequenced using oJL115 
(SEQ ID NO: 11) as a primer. In agreement with the Southern blot experiments, only one 
insertion per strain was detected. Insertion sites were distributed on all six chromosomes 
(Figure 7B). Transposition occurred into exons, introns and intergenic regions (Table 2). 
Sequences flanking both sides of the transposon were determined for nine of the localized 
insertions. In each case, the inverted terminal repeats were complete and flanked by a TA 
dinucleotide that arose from the duplication of the original TA found in the genomic sequence 
(Figure 7C). 
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[0086] Table 2. Properties of Mosl genomic insertions. Mosl flanking were compared 
with the C elegans genome sequence. Physical location in the genome is given as the nucleotide 
position of the corresponding clone in the C elegans database (ACeDB). 



Isolation 
Name 


Transposon source 


Physical location 


Interpolated 
genetic location 


Genefinder 
predictions 


oxTil 


Extrachromosomal 


Y65B4BL @ 27,362 


LGI, -19 


Intergenic 


oxTi2 


Extrachromosomal 


Y44E3A @ 34,440 


LGI, -4.75 


Intergenic 


oxTi3 


Extrachromosomal 


M01E5 @ 19,740 


LGI, +29.9 


Intergenic 


oxTi4 


hxtracnromosomal 


1 13L2 @ 4,94o 


Lull, +0. 1 


bxon #4 oi 
F41G.12 


oxTiS 


Extrachromosomal 


K08E5@ 31,631 


LGIII, +4.61 


Intergenic 


OX 1 10 


Extrachromosomal 


TJOTT o A /7?\ a con 

ri23L24 @ 4,529 


LCjIV, +i.y 


Intergenic 


oxliy 


Extrachromosomal 


V (\0T\O A OO A 

KUoDo @ 4,234 


LCjIV, +0.6 


Intergenic 


oxTi8 


Extrachromosomal 


r> f\r\Ti c oo nor* 

R09B5 @ 22,929 


LGV, -19.0 


T"?„ 11/' _ C 

Exon #6 of 
R09B5.12 


oxliy 


Extrachromosomal 


Yo9ri2 @ 39,771 


T f^\7 in A€\ 

LGV, +17.49 


Intron #5 of 
Y69H2.4 


oxTilO 


Extrachromosomal 


Y47C4A 


LGX, -20 


Repeat 


oxTill 


Extrachromosomal 


C34E11 @ 12,022 


LGX, +6.55 


Exon #10 of 
C34E11.1 


oxTil2 


Integrated array 


Y71A12B@ 50,370 


LGI, +21 


Intergenic 


oxTil3 


Integrated array 


Y48G1C @ 19,916 


LGI, -19.8 


Intergenic 


oxTiH 


Integrated array 


C17F4® 22,793 


LGn, -8.06 


Exon #18 of 
gcy-19 


oxTilS 


Integrated array 


F35C5 @ 5,735 


LGII, +10 


Intergenic 


oxTil6 


Integrated array 


C06B3 @ 14,747 


LGV, +5.79 


Intergenic 


oxTi!7 


Integrated array 


R01H2@ 20,193 


LGIII, -0.86 


Intergenic 



[0087] Comparison of the insertion site sequences did not reveal a strong consensus 
motif for the target DNA. Molecular analysis of the insertions therefore demonstrated that Mosl 
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insertion obeyed properties previously observed for mariner class transposons. However, a 
formal possibility remained that Mosl hopped into the genomic DNA present in one of the 
extrachromosomal arrays and that recombination occurred subsequently between the array and 
the genome. To rule out this possibility, the insertion oxTi4 which was positioned 35 kb away 
from snt-1 was genetically mapped. In agreement with this physical location, oxTi4 was mapped 
less than 2.5 map units from snt-1: 20 Snt-1 individuals were cloned from the self-progeny of 
oxTi4/snt-l hermaphrodites. None of the mutants segregated oxTi4. The presence of oxTi4 was 
determined by PCR using one Mosl primer pointing towards the right end of the transposon 
(oJL89 (SEQ ID NO: 5)) and one primer in the genome (oJL129 
5'-CCAAATGCGTCTGTCCCACTC (SEQ ID NO: 13)). A PCR positive control was 
performed on each DNA sample using cha-1 primers. 

Example 11 - Remobilization of a Genomic Transposon Insertion 
[0088] The transposition events documented above were all excisions from an array of 
transposons residing in Drosophila DNA. To determine whether the transposase acts on a single 
Mosl transposon in a C elegans chromosome, the oxTi4 insert was remobilized. Primers for 
PCR were designed flanking the oxTi4 insertion. A first PCR round was performed with primers 
located 1671 nt upstream and 3144 bp downstream to oxTi 4 oxTi4 (respectively oJL149 
5 AAGTATGGCC AAACGACCCGAC AC (SEQ ID NO: 14) and oJL150 
5 -GCATTGGCACCTTTCTCCCTTCT (SEQ ID NO: 15)). A second round was performed 
using primers 493 bp upstream and 913 downstream to oxTi4 (respectively oJL145 

5 '- AC AGGC AGC ATTTTGTAGTCT (SEQ ID NO: 16) and oJL148 

AGGCTGCCTCGTAAGTTCCTAC AG 5 -AGGCTGCCTCGTAAGTTCCTACAG (SEP ID 
NO: 17)). Short PCR products were gel purified, subcloned and sequenced. The 
transposase-expressing transgene {oxExl 67 [glh-2: Transposase]) was crossed into animals 
homozygous for the oxTi4 insertion and DNAs from the progeny were analyzed for amplified 
fragments shorter than the insertion. These shorter PCR products represented a variety of 
excision events, including the three nucleotide excision footprint previously characterized for 
Mosl excisions (G. Bryan, D. Garza, D. Haiti, Genetics 125, 103 (1990)), as well as smaller 
footprints, excisions and even incomplete excisions (Table 3). Since these products could arise 
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from excision events in somatic cells, progeny animals that lost the transposase expression array 
were analyzed. Pools of 15 individuals from oxTi4\ oxExl 67 [glh-2: Transposase] progeny that 
lost the transposase array were transferred to fresh plates and allowed to lay eggs for 24 hours. 
Adult worms were then analyzed by a single round of PCR using the primers oJL145 (SEQ ID 
NO: 16)-oJL148 (SEQ ID NO: 17). Sixty individuals were cloned from the progeny of the pool 
exhibiting short PCR product and analyzed at the next generation to identify clones that lost 
oxTi4. One animal was identified among 954 progeny in which excision of the transposon had 
occurred. In this animal the excision left a 3 bp footprint and the duplicated TA dinucleotide 
which together resulted in a +2 frameshift. These data indicate that single copies of the Mosl 
Drosophila transposon can excise from C. elegans DNA in the germline to introduce frameshift 
or deletion mutations at the transposon insertion site. 
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Example 12: - Engineering Mutations in th e C. e l e gan s C. elegans Genome by Transeene 
Instructed DNA Double Strand Break Repai r Foil wing Mosj- Following Mosl Excision 
[0090] Germline expression of the Mos transposase under the control of the glh-2 
promoter causes reexcision of single copies of Mosl inserted in the C. elegans genome. 
Immobilization of the transposon causes a DNA double strand break (DSB) at the site of 
excision which is repaired by the cellular machinery. In 1992, R. Plasterk and J. Groenen 
(EMBO 7. 11:287) demonstrated that a DSB caused by excision of a Tel transposon in a 
mut-6(st702) background can be repaired using DNA contained in an extrachromosomal array 
that carries sequences homologous to the region of excision. As a result, sequences flanking the 
break can be replaced by sequences contained in the transgene. This strategy provides a way to 
engineer mutations in the genome. However, this approach never became a routine strategy 
probably because transposition is not controlled and excision occurs at low rates in such mutant 
strains. 

[0091] The controlled transposition of-Mesj - Mosl p rovides an efficient tool to use this 
strategy for engineering of the C. elegans genome: after a Mosl Mosl insertion has been 
identified in the gene of interest, a transgene is constructed with mutated sequences homologous 
to the region of insertion. The transgene that carries the glh 2::MosTransposas e glh-2: :Mos 
transposase expression vector is crossed into the starin strain that contain the Mosl Mosl 
genomic insertion and the template transgene. Expression of-Mos- Mos transposase causes-Me&l- 
Mosl excision and the progeny is screened by PCR for transgene instructed repair at the excision 
site (Figure 8). 

[0092] The feasibility of regulated mobilization of a heterologous transposon in the 
C elegans germline was thus demonstrated. The characteristics of Mosl transposition suggest 
that it could be used as a technique for tagging mutant genes. First, the Mos transposase does not 
activate transposition of endogenous transposons. Second, transposition of Mos in the germline 
is strictly dependent on the expression of the transposase. In this r e sp e ct respect, the use of a 
heat-shock promoter to express the transposase is if particular interest since it provides a 
convenient way to turn transposition on and off and to stabilize new inserts. Third, insertion 
sites of Mosl in the genome do not exhibit strong sequence bias. Transposons were inserted into 
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exons, introns and intergenic regions. Comparison of the insertion sites did not reveal a strong 
consensus sequence apart from the TA dinucleotide. Fourth, excision and insertion frequencies 
can be differentially manipulated by expressing the transposase under the control of different 
promoters. The h e atshock heat-shock p romoter caused very low rates of excision and loss of the 
transposon array but high rates of transposon insertion. The glh-2 promoter construct caused a 
low rate of insertion but a high rate of excision and loss of the transposon array. Since 
transposon insertions frequently do not disrupt gene function in C elegans even if the insertion 
occurs in an exon (A. M. Rushforth, B. Saari, P. Anderson, Mol Cell Biol 13, 902 (1993); A. 
M. Rushforth, P. Anderson, Mol Cell Biol 16, 422 (1996)), transposons are usually remobilized 
to generate deletion alleles (D. Eide, P. Anderson, Mol Cell Biol 8, 737 (1988); R. R. Zwaal, A. 
Broeks, J. van Meurs, J. T. Groenen, R. H. Plasterk, Proc. Natl Acad. Set USA 90, 7431 
(1993)). It was thus demonstrated that the glh-2 expression construct can be used to generate 
deletion alleles of the genes containing Mosl insertions. 

[0093] Mosl transposition in C. elegans will allow the development of two new genetic 
tools. First, mutations identified in forward screens using Mosl will allow the rapid cloning of 
the mutated gene. Second, a library of insertions localized in the genome could be generated; the 
glh-2 expression construct could then be used to remobilize these insertions at high frequency 
and generate deletion and frameshift mutations in genes of interest. 
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ABSTRACT 

The present invention provides methods for generating and identifying mutations in 
Caenorhabditis elegans. Disclosed herein are methods for introducing DNA into the C. elegans 
germline, methods for expressing DNA in the C elegans germline, and methods for generating 
C. elegans mutants by using controlled mobilization of endogenous and heterologous 
transposons. Also disclosed are a transgene construct for expression in C. elegans and methods 
for inserting single copy DNA sequences into the C elegans germline, and engineering 
mutations into the C. elegans genome. 
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SEQUENCE LISTING 

<110> Universtiy of Utah Research Foundation 
Bessereau, Jean-Louis 
Jorgensen, Erik 

<120> Method of Transposon-Mediated Mutagenesis in the Nematode 
Caenorhabditis Elegans 

<130> 0274-6277. 1US 

<140> US 09/980,644 
<141> 2001-11-01 

<150> PCT/US00/40091 
<151> 2000-06-01 

<150> US 60/136,972 
<151> 1999-06-01 

<160> 56 

<170> Patentln version 3.2 

<210> 1 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL102 
<400> 1 

caaccttgac tgtcgaacca ccatag 



<210> 2 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL104 

<400> 2 

acaaagagcg aacgcagacg agt 



<210> 3 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 
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<223> Primer OJL103 



<400> 3 

tctgcgagtt gtttttgcgt ttgag 



<210> 4 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 0JL88 

<400> 4 

cgcatgcggc ttactcac 



<210> 5 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL89 

<400> 5 

ggccccatcc gattaccacc ta 



<210> 6 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL19 

<400> 6 

gcgaaacgca taccaactgt a 

<210> 7 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL2 0 

<400> 7 

ttcatgccga aaagcaggcg t 



<210> 8 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL132 

<400> 8 

atatgcggtg cgatgggtga g 



<210> 9 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL133 

<400> 9 

ggcgaacgcg atgagaagaa ag 



<210> 10 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL114 

<400> 10 

aaagattcag aaggtcggta gatggg 



<210> 11 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL115 

<400> 11 

gctcaattcg cgccaaacta tg 

<210> 12 

<211> 24 

<212> DNA 

<213> Artificial 



<223> Primer 0JLII6 



<400> 12 

gaacgagaga ggcagatgga gagg 

<210> 13 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL129 
<400> 13 

ccaaatgcgt ctgtcccact c 



<210> 14 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer OJL149 

<400> 14 

aagtatggcc aaacgacccg acac 

<210> 15 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> primer OJL150 

<400> 15 

gcattggcac ctttctccct tct 



<210> 16 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> primer OJL145 

<400> 16 

acaggcagca ttttgtagtc t 



<210> 17 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> primer OJL148 

<400> 17 

aggctgcctc gtaagttcct acag 



<210> 18 

<211> 51 

<212> DNA 

<213> Artificial 

<220> 

<223> artificial intron 

<400> 18 

gtaagtttaa acatatatac taactaaccc atggattatt taaattttca 



<210> 19 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> primer oJL21 

<400> 19 

cgaagcttgc tggacggaaa tagtgg 

<210> 20 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> primer OJL22 

<400> 20 

cgacgcgttc ttgaagttta gagaat 

<210> 21 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 
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<223> primer oJL77 
<400> 21 

gcacgcgtta tgtcgagttt cgtgccgaat aaag 



<210> 22 

<211> 49 

<212> DNA 

<213> Artificial 



34 



<220> 

<223> primer oJL78 
<400> 22 

gcgctagcta ttcaaagtat ttgccgtcgc tcgcgacaca tttttccca 4 9 



<210> 23 

<211> 347 

<212> DNA 

<213> Artificial 

<220> 

<223> inverse PCR product 
<400> 23 

cagtcaaggt tgacacttac aaggtcaaag ttttatgaca atcgataaat atttacgttt 60 

gcgagacatc tatatgttcg aaccgacatt ccctacttgt acacctggta aatgaaagct 120 

ggtgacgtgg agattacgtc cccgtaaaaa ttattgcgaa atatgcaacg gtggccgaga 180 

aaatccgcga ccccgtcgac ccagacacgg ttgattctcc agtgacggtc gatcaacaaa 24 0 

aaagatccat ttttcatctc cagtaacgat acgatgcaaa aacgacttcc ttttgtatcg 300 

tgaaagcaaa atttcgcatg tgtttttgcg cctctccatc tgcctct 347 



<210> 24 

<211> 21 

<212> DNA 

<213> Drosophila mauritiana 

<400> 24 

gcagtcaaca tgtcgagttt c 21 

<210> 25 

<211> 18 

<212> DNA 

<213> Drosophila mauritiana 
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<400> 25 

gatgctcaaa cgcaaaaa 



18 



<210> 26 
<211> 20 
<212> DNA 

<213> Drosophila mauritiana 
<400> 26 

tttgaataaa tgattttttc 2 0 



<210> 27 

<211> 4 

<212> PRT 

<213> Drosophila mauritiana 

<400> 27 

Met Ser Ser Phe 

1 



<210> 28 
<211> 6 
<212> PRT 



<213> Drosophila mauritiana 
<400> 28 

Asp Ala Gin Thr Gin Lys 



<210> 29 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTil insertion of Mosl into C. elegans genome 

<400> 29 

gtttagcgac gagtgacata ccaggtgtac 3 0 



<210> 30 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTil insertion of Mosl into C. elegans genome 

<400> 30 
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gtacacctga taattctccg aaagcttcag 



30 



<210> 31 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi2 insertion of Mosl into C. elegans genome 

<400> 31 

tcgataaata aattatttta ccaggtgtac 30 



<210> 32 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi2 insertion of Mosl into C. elegans genome 

<400> 32 

gtacacctga taattctatc caaaaatcgc 30 



<210> 33 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi3 insertion of Mosl into C. elegans genome 

<400> 33 

aaagtagtgg atgcgatata ccaggtgtac 3 0 



<210> 34 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi3 insertion of Mosl into C. elegans genome 

<400> 34 

gtacacctga taataagaga ggcgaaggat 3 0 



<210> 35 
<211> 30 
<212> DNA 
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<213> Artificial 
<220> 

<223> oxTi4 insertion of Mosl into C. elegans genome 
<400> 35 

tcctcttttc cagacgagta ccaggtgtac 30 



<210> 36 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi4 insertion of Mosl into C. elegans genome 

<400> 36 

gtacacctga tatatccttt tgttccttgc 30 



<210> 37 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTiS insertion of Mosl into C. elegans genome 

<400> 37 

gtcggacaat cagaagtgta ccaggtgtac 3 0 



<210> 38 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTiS insertion of Mosl into C. elegans genome 

<400> 38 

gtacacctga taagaactaa aaggacaccg 3 0 



<210> 39 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi6 insertion of Mosl into C. elegans genome 

<400> 39 
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ttgaacaata aatactaata ccaggtgtac 



<210> 40 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTiS insertion of Mosl into C. elegans genome 

<400> 40 

gtacacctga tattgttgtc ctcaagattt 



<210> 41 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi8 insertion of Mosl into C. elegans genome 

<400> 41 

gacgcaataa atccacaata ccaggtgtac 

<210> 42 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi8 insertion of Mosl into C. elegans genome 

<400> 42 

gtacacctga taattttccc gactcttaca 



<210> 43 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi9 insertion of Mosl into C. elegans genome 

<400> 43 

ccctctccaa tagtctagta ccaggtgtac 

<210> 44 

<211> 30 
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<212> DNA 

<213> Artificial 

<220> 

<223> oxTi9 insertion of Mosl into C. elegans genome 
<400> 44 

gtacacctga taaatgtcat cagaattcat 



<210> 45 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTill insertion of Mosl into C. elegans genome 

<400> 45 

accaaaagca aaaacactta ccaggtgtac 

<210> 46 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTill insertion of Mosl into C. elegans genome 

<400> 46 

gtacacctga taaccaaatg atgggtggca 



<210> 47 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi4 insertion of Mosl 

<400> 47 

ctcttttcca gacgagtacc aggtgtac 

<210> 48 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> oxTi4 insertion of Mosl 
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<400> 48 

tacacctgat atatcctttt gttcctt 
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<210> 49 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> lesion generated after removal of Mosl in oxTi4 insertion 

<400> 49 

ctcttttcca gacgagtata tatccttttg ttcctt 



<210> 50 

<211> 37 

<212> DNA 

<213> Artificial 

<220> 

<223> lesion generated after removal of Mosl in oxTi4 insertion 

<400> 50 

ctcttttcca gacgagtaat atatcctttt gttcctt 



<210> 51 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> lesion generated after removal of Mosl in oxTi4 insertion 

<400> 51 

ctcttttcca gacgagtatg atatatcctt ttgttcctt 

<210> 52 

<211> 37 

<212> DNA 

<213> Artificial 

<220> 

<223> lesion generated after removal of Mosl in oxTi4 insertion 

<400> 52 

ctcttttcca gacgagtact atatcctttt gttcctt 



<210> 53 
<211> 21 
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<212> DNA 

<213> Artificial 

<220> 

<223> lesion generated after removal of Mosl in oxTi4 insertion 
<400> 53 

tgatatatcc ttttgttcct t 



<210> 54 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> lesion generated after removal of Mosl in oxTi4 insertion 

<400> 54 

ctcttttcca gacgaga 



<210> 55 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> lesion generated after removal of Mosl in oxTi4 insertion 

<400> 55 

ctcttttcca gacgagta 



<210> 56 

<211> 66 

<212> DNA 

<213> Artificial 

<220> 

<22 3> lesion generated after removal of Mosl in oxTi4 insertion 

<400> 56 

ctcttttcca gacgagtaat tgtttactct cagtgcagtc aacatgtcga tatccttttg 
ttcctt 
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